Modelling pronominal anaphora in statistical machine translation

نویسندگان

  • Christian Hardmeier
  • Marcello Federico
چکیده

Current Statistical Machine Translation (SMT) systems translate texts sentence by sentence without considering any cross-sentential context. Assuming independence between sentences makes it difficult to take certain translation decisions when the necessary information cannot be determined locally. We argue for the necessity to include crosssentence dependencies in SMT. As a case in point, we study the problem of pronominal anaphora translation by manually evaluating German-English SMT output. We then present a word dependency model for SMT, which can represent links between word pairs in the same or in different sentences. We use this model to integrate the output of a coreference resolution system into English-German SMT with a view to improving the translation of anaphoric pronouns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Pronominal Divergence and Anaphora Resolution in English-Hindi Machine Translation

So far the majority of Machine Translation (MT) research has focused on translation at the level of individual sentences. For sentence level translation, Machine Translation has addressed various divergence issues for large variety of languages; the issue of pronominal divergence has been presented only recently. Since the quality of translation as required by users follows coherent multi-sente...

متن کامل

Pronominal Anaphora in Machine Translation

State-of-the-art machine translation systems use strong assumptions of independence. Following these assumptions language is split into small segments such as sentences and phrases which are translated independently. Natural language, however, is not independent: many concepts depend on context. One such case is reference introduced by pronominal anaphora. In pronominal anaphora a pronoun word ...

متن کامل

Proposal of an English-Spanish Interlingual Mechanism Focused on Pronominal Anaphora Resolution and Generation in Machine Translation Systems

In this paper an interlingual mechanism oriented to pronominal references resolution and generation in Machine Translation (MT) systems is proposed. This mechanism is based on Slot Structure (SS) presented in [3] [2]. A comparison of pronominal references resolution both in English and in Spanish is developed to accomplish a study of the existing discrepancies between two languages. From this s...

متن کامل

Exploring Semantic Information from Hindi Dependency Treebank for Resolving Pronominal Anaphora

Anaphora Resolution is exigent task in almost all NLP applications such as text summarization, machine translation, information extraction, question-answering systems, etc. A lot of work has been done for identifying and still more need to be done for finding the factors responsible for resolving the anaphoras in all languages. An attempt has been made to resolve Hindi pronominal anaphora using...

متن کامل

Coreference-Oriented Interlingual Slot Structure And Machine Translation

One of the main problems of many commercial Machine Translation (MT) and experimental systems is that they do not carry out a correct pronominal anaphora generation. As mentioned in Mitkov (1996), solving the anaphora and extracting the antecedent are key issues in a correct translation. In this paper, we propose an Interlingual mechanism that we have called lnterlingual Slot Structure (ISS) ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010